Improving Data Locality Using Dynamic Page Migration Based on Memory Access Histograms
نویسندگان
چکیده
This Paper presents an approach which dynamically and transparently improves the data locality of memory references in Non-Uniform Memory Access (NUMA) characterized systems. The approach is based on run-time data redistribution via user-level page migration. It uses memory access histograms gathered by hardware monitors to make correct decisions related to the placement of shared data. First performance experiments on several applications show the potential for a significant gain in speedup. In addition, a graphical user interface has been developed showing the actual data movement thereby helping the user to understand the behavior of the application and to detect performance bottlenecks. This feature complements an already existing Data Layout Visualization tool for the observation of memory locality.
منابع مشابه
Geographical Locality and Dynamic Data Migration for OpenMP Implementations of Adaptive PDE Solvers
On cc-NUMA multi-processors, the non-uniformity of main memory latencies motivates the need for co-location of threads and data. We call this special form of data locality, geographical locality. In this article, we study the performance of a parallel PDE solver with adaptive mesh refinement. The solver is parallelized using OpenMP and the adaptive mesh refinement makes dynamic load balancing n...
متن کاملScheduler-Activated Dynamic Page Migration for Multiprogrammed DSM Multiprocessors
The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interventions that neglect data locality. On cachecoherent distributed shared-memory (DSM) multiprocessors, such scheduler interventions tend to increase the rate of remote memory accesses. This paper presents a novel dynamic page migration algorithm that remedies this problem in iterative parallel pro...
متن کاملA Study of Implicit Data Distribution Methods for OpenMP Using the SPEC Benchmarks
In contrast to the common belief that OpenMP requires data-parallel extensions to scale well on architectures with non-uniform memory access latency, recent work has shown that it is possible to develop OpenMP programs with good levels of memory access locality, without any extension of the OpenMP API. The vehicle for localizing memory accesses transparently to the programming model, is a runti...
متن کاملMemory Migration on Next-Touch
NUMA abilities such as explicit migration of memory buffers enable flexible placement of data buffers at runtime near the tasks that actually access them. The move_pages system call may be invoked manually but it achieves limited throughput and implies a strong collaboration of the application. Indeed, the location of threads and their memory access patterns must be carefully known so as to dec...
متن کاملOdin: Design and Evaluation of a Single Address Space Multiprocessor
Odin is a new high performance single address space multiprocessor design. The contribution of this investigation is the synthesis of three important new methods into a unified system which maximises data locality and significantly reduces data access latencies. To achieve high performance Odin uses a segmented stack to maintain data locality after thread migration, and a memory mapping that di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002